智能论文笔记

An analysis of reconstruction noise from undersampled 4D flow MRI

Lauren Partin , Daniele E. Schiavazzi , Carlos A. Sing Long

分类：计算机视觉

2022-01-11

新磁共振（MR）成像方式可以量化血流动力学，但需要长时间的采集时间，妨碍其广泛用于早期诊断心血管疾病。为了减少采集时间，常规使用来自未采样测量的重建方法，使得利用旨在提高图像可压缩性的表示。重建的解剖和血液动力学图像可能存在视觉伪影。尽管这些工件中的一些基本上是重建错误，因此欠采样的后果，其他人可能是由于测量噪声或采样频率的随机选择。另有说明，重建的图像变为随机变量，并且其偏差和其协方差都可以导致视觉伪影;后者会导致可能误解的空间相关性以用于视觉信息。虽然前者的性质已经在文献中已经研究过，但后者尚未得到关注。在这项研究中，我们研究了从重建过程产生的随机扰动的理论特性，并对模拟和主动脉瘤进行了许多数值实验。我们的结果表明，当基于$ \ ell_1 $ -norm最小化的高斯欠采样模式与恢复算法组合时，相关长度保持限制为2到三个像素。然而，对于其他欠采样模式，相关长度可以显着增加，较高的欠采样因子（即8倍或16倍压缩）和不同的重建方法。

translated by 谷歌翻译

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Teven Le Scao , Angela Fan , Christopher Akiki , Ellie Pavlick , Suzana Ilić , Daniel Hesslow , Roman Castagné , Alexandra Sasha Luccioni , François Yvon , Matthias Gallé

分类：自然语言处理

2022-11-09

Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access language model designed and built thanks to a collaboration of hundreds of researchers. BLOOM is a decoder-only Transformer language model that was trained on the ROOTS corpus, a dataset comprising hundreds of sources in 46 natural and 13 programming languages (59 in total). We find that BLOOM achieves competitive performance on a wide variety of benchmarks, with stronger results after undergoing multitask prompted finetuning. To facilitate future research and applications using LLMs, we publicly release our models and code under the Responsible AI License.

translated by 谷歌翻译

One System to Rule them All: a Universal Intent Recognition System for Customer Service Chatbots

Juan Camilo Vasquez-Correa , Juan Carlos Guerrero-Sierra , Jose Luis Pemberty-Tamayo , Juan Esteban Jaramillo , Andres Felipe Tejada-Castro

分类：自然语言处理

2021-12-15

客户服务Chatbots是对话系统，旨在为客户提供有关不同公司提供的产品/服务的信息。特别地，意图识别是自然语言低估Chatbot系统的能力的核心组件之一。在聊天训练识别的不同意图中，他们有一组是通用的任何客户服务Chatbot。普遍意图可以包括称呼，将对话交给人类代理人，告别。识别这些普遍意图的系统将非常有助于优化特定客户服务聊天训练过程。我们提出了一个普遍意图识别系统的发展，该系统受过培训，以识别28个不同的聊天跳闸中常见的11个意图组。拟议的系统考虑了最先进的单词嵌入模型，例如Word2VEC和BERT，基于卷积和经常性神经网络的深层分类器。所提出的模型能够区分这些普遍意图，均衡精度高达80.4 \％。此外，所提出的系统同样准确地识别短期和长文本请求中表达的意图。同时，错误分类错误通常发生在具有非常相似的语义领域，例如告别和正面评论之间。建议的系统将非常有帮助优化客户服务Chatbot的培训过程，因为我们的系统已经可用并检测到一些意图。与此同时，拟议的方法将是一个合适的基础模型，通过应用转移学习策略培训更具体的聊天措施。

translated by 谷歌翻译

Non-IID data and Continual Learning processes in Federated Learning: A long road ahead

Marcos F. Criado , Fernando E. Casado , Roberto Iglesias , Carlos V. Regueiro , Senén Barro

分类：机器学习

2021-11-26

联合学习是一种新颖的框架，允许多个设备或机构在保留其私有数据时协同地培训机器学习模型。这种分散的方法易于遭受数据统计异质性的后果，无论是在不同的实体还是随着时间的推移，这可能导致缺乏会聚。为避免此类问题，在过去几年中提出了不同的方法。然而，数据可能在许多不同的方式中是异构的，并且当前的建议并不总是确定他们正在考虑的异质性的那种。在这项工作中，我们正式地分类数据统计异质性，并审查能够面对它的最显着的学习策略。与此同时，我们介绍了其他机器学习框架的方法，例如持续学习，也处理数据异质性，并且可以很容易地适应联邦学习设置。

translated by 谷歌翻译

Ensemble learning techniques for intrusion detection system in the context of cybersecurity

Andricson Abeline Moreira , Carlos A. C. Tojeiro , Carlos J. Reis , Gustavo Henrique Massaro , Igor Andrade Brito e Kelton A. P. da Costa

分类：机器学习

2022-12-21

Recently, there has been an interest in improving the resources available in Intrusion Detection System (IDS) techniques. In this sense, several studies related to cybersecurity show that the environment invasions and information kidnapping are increasingly recurrent and complex. The criticality of the business involving operations in an environment using computing resources does not allow the vulnerability of the information. Cybersecurity has taken on a dimension within the universe of indispensable technology in corporations, and the prevention of risks of invasions into the environment is dealt with daily by Security teams. Thus, the main objective of the study was to investigate the Ensemble Learning technique using the Stacking method, supported by the Support Vector Machine (SVM) and k-Nearest Neighbour (kNN) algorithms aiming at an optimization of the results for DDoS attack detection. For this, the Intrusion Detection System concept was used with the application of the Data Mining and Machine Learning Orange tool to obtain better results

translated by 谷歌翻译

LayoutDETR: Detection Transformer Is a Good Multimodal Layout Designer

Ning Yu , Chia-Chih Chen , Zeyuan Chen , Rui Meng , Gang Wu , Paul Josel , Juan Carlos Niebles , Caiming Xiong , Ran Xu

分类：计算机视觉

2022-12-19

Graphic layout designs play an essential role in visual communication. Yet handcrafting layout designs are skill-demanding, time-consuming, and non-scalable to batch production. Although generative models emerge to make design automation no longer utopian, it remains non-trivial to customize designs that comply with designers' multimodal desires, i.e., constrained by background images and driven by foreground contents. In this study, we propose \textit{LayoutDETR} that inherits the high quality and realism from generative modeling, in the meanwhile reformulating content-aware requirements as a detection problem: we learn to detect in a background image the reasonable locations, scales, and spatial relations for multimodal elements in a layout. Experiments validate that our solution yields new state-of-the-art performance for layout generation on public benchmarks and on our newly-curated ads banner dataset. For practical usage, we build our solution into a graphical system that facilitates user studies. We demonstrate that our designs attract more subjective preference than baselines by significant margins. Our code, models, dataset, graphical system, and demos are available at https://github.com/salesforce/LayoutDETR.

translated by 谷歌翻译

A Brief Survey on Person Recognition at a Distance

Chrisopher B. Nalty , Neehar Peri , Joshua Gleason , Carlos D. Castillo , Shuowen Hu , Thirimachos Bourlai , Rama Chellappa

分类：计算机视觉

2022-12-17

Person recognition at a distance entails recognizing the identity of an individual appearing in images or videos collected by long-range imaging systems such as drones or surveillance cameras. Despite recent advances in deep convolutional neural networks (DCNNs), this remains challenging. Images or videos collected by long-range cameras often suffer from atmospheric turbulence, blur, low-resolution, unconstrained poses, and poor illumination. In this paper, we provide a brief survey of recent advances in person recognition at a distance. In particular, we review recent work in multi-spectral face verification, person re-identification, and gait-based analysis techniques. Furthermore, we discuss the merits and drawbacks of existing approaches and identify important, yet under explored challenges for deploying remote person recognition systems in-the-wild.

translated by 谷歌翻译

A Neural Network Warm-Start Approach for the Inverse Acoustic Obstacle Scattering Problem

Mo Zhou , Jiequn Han , Manas Rachh , Carlos Borges

分类：机器学习

2022-12-16

We consider the inverse acoustic obstacle problem for sound-soft star-shaped obstacles in two dimensions wherein the boundary of the obstacle is determined from measurements of the scattered field at a collection of receivers outside the object. One of the standard approaches for solving this problem is to reformulate it as an optimization problem: finding the boundary of the domain that minimizes the $L^2$ distance between computed values of the scattered field and the given measurement data. The optimization problem is computationally challenging since the local set of convexity shrinks with increasing frequency and results in an increasing number of local minima in the vicinity of the true solution. In many practical experimental settings, low frequency measurements are unavailable due to limitations of the experimental setup or the sensors used for measurement. Thus, obtaining a good initial guess for the optimization problem plays a vital role in this environment. We present a neural network warm-start approach for solving the inverse scattering problem, where an initial guess for the optimization problem is obtained using a trained neural network. We demonstrate the effectiveness of our method with several numerical examples. For high frequency problems, this approach outperforms traditional iterative methods such as Gauss-Newton initialized without any prior (i.e., initialized using a unit circle), or initialized using the solution of a direct method such as the linear sampling method. The algorithm remains robust to noise in the scattered field measurements and also converges to the true solution for limited aperture data. However, the number of training samples required to train the neural network scales exponentially in frequency and the complexity of the obstacles considered. We conclude with a discussion of this phenomenon and potential directions for future research.

translated by 谷歌翻译

Biomedical image analysis competitions: The state of current participation practice

Matthias Eisenmann , Annika Reinke , Vivienn Weru , Minu Dietlinde Tizabi , Fabian Isensee , Tim J. Adler , Patrick Godau , Veronika Cheplygina , Michal Kozubek , Sharib Ali

分类：计算机视觉 | 机器学习

2022-12-16

The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis, we designed an international survey that was issued to all participants of challenges conducted in conjunction with the IEEE ISBI 2021 and MICCAI 2021 conferences (80 competitions in total). The survey covered participants' expertise and working environments, their chosen strategies, as well as algorithm characteristics. A median of 72% challenge participants took part in the survey. According to our results, knowledge exchange was the primary incentive (70%) for participation, while the reception of prize money played only a minor role (16%). While a median of 80 working hours was spent on method development, a large portion of participants stated that they did not have enough time for method development (32%). 25% perceived the infrastructure to be a bottleneck. Overall, 94% of all solutions were deep learning-based. Of these, 84% were based on standard architectures. 43% of the respondents reported that the data samples (e.g., images) were too large to be processed at once. This was most commonly addressed by patch-based training (69%), downsampling (37%), and solving 3D analysis tasks as a series of 2D tasks. K-fold cross-validation on the training set was performed by only 37% of the participants and only 50% of the participants performed ensembling based on multiple identical models (61%) or heterogeneous models (39%). 48% of the respondents applied postprocessing steps.

translated by 谷歌翻译

A Deep Double Ritz Method for solving Partial Differential Equations

Carlos Uriarte , David Pardo , Ignacio Muga , Judit Muñoz-Matute

分类：机器学习

2022-11-07

Residual minimization is a widely used technique for solving Partial Differential Equations in variational form. It minimizes the dual norm of the residual, which naturally yields a saddle-point (min-max) problem over the so-called trial and test spaces. Such min-max problem is highly non-linear, and traditional methods often employ different mixed formulations to approximate it. Alternatively, it is possible to address the above saddle-point problem by employing Adversarial Neural Networks: one network approximates the global trial minimum, while another network seeks the test maximizer. However, this approach is numerically unstable due to a lack of continuity of the text maximizers with respect to the trial functions as we approach the exact solution. To overcome this, we reformulate the residual minimization as an equivalent minimization of a Ritz functional fed by optimal test functions computed from another Ritz functional minimization. The resulting Deep Double Ritz Method combines two Neural Networks for approximating the trial and optimal test functions. Numerical results on several 1D diffusion and convection problems support the robustness of our method up to the approximability and trainability capacity of the networks and the optimizer.

translated by 谷歌翻译